AITopics | bayesian perspective

322f62469c5e3c7dc3e58f5a4d1ea399-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 00:34:08 GMT

double descent, marginalization, multisw ag, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.51)

Add feedback

Generalization in the Face of Adaptivity: A Bayesian Perspective

Neural Information Processing SystemsDec-26-2025, 11:52:11 GMT

Repeated use of a data sample via adaptively chosen queries can rapidly lead to overfitting, wherein the empirical evaluation of queries on the sample significantly deviates from their mean with respect to the underlying data distribution. It turns out that simple noise addition algorithms suffice to prevent this issue, and differential privacy-based analysis of these algorithms shows that they can handle an asymptotically optimal number of queries. However, differential privacy's worst-case nature entails scaling such noise to the range of the queries even for highly-concentrated queries, or introducing more complex algorithms.In this paper, we prove that straightforward noise-addition algorithms already provide variance-dependent guarantees that also extend to unbounded queries. This improvement stems from a novel characterization that illuminates the core problem of adaptive data analysis. We show that the harm of adaptivity results from the covariance between the new query and a Bayes factor-based measure of how much information about the data sample was encoded in the responses given to past queries. We then leverage this characterization to introduce a new data-dependent stability notion that can bound this covariance.

generalization, name change, query, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

A Bayesian Perspective on Training Speed and Model Selection

Neural Information Processing SystemsDec-24-2025, 04:53:44 GMT

We take a Bayesian perspective to illustrate a connection between training speed and the marginal likelihood in linear models. This provides two major insights: first, that a measure of a model's training speed can be used to estimate its marginal likelihood. Second, that this measure, under certain conditions, predicts the relative weighting of models in linear model combinations trained to minimize a regression loss. We verify our results in model selection tasks for linear models and for the infinite-width limit of deep neural networks. We further provide encouraging empirical evidence that the intuition developed in these settings also holds for deep neural networks trained with stochastic gradient descent. Our results suggest a promising new direction towards explaining why neural networks trained with stochastic gradient descent are biased towards functions that generalize well.

bayesian perspective, name change, training speed and model selection, (7 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.85)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

322f62469c5e3c7dc3e58f5a4d1ea399-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 14:57:09 GMT

artificial intelligence, machine learning, multisw ag, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.72)

Add feedback

Review for NeurIPS paper: A Bayesian Perspective on Training Speed and Model Selection

Neural Information Processing SystemsJan-25-2025, 21:29:07 GMT

Weaknesses: At Eq. 5, the authors introduce two sampling based estimators of the lower bound (LB). I am not sure why the authors introduced both as an estimator for the LB: The second estimator is an unbiased estimator of the (log) marginal likelihood (ML). Though it could technically be considered a biased estimator of LB, I do not see why it should be introduced as such, since it is the unbiased estimator of the exact value the authors are hoping to approximate. Actually, in the following sentence the authors write that the second estimator's bias decreases as J is increased, which is very much expected, if not almost trivial considering the point above. Another point is that when J 1 the two estimators are algebraically the same, therefore the first one also becomes (a noisy) unbiased estimator of ML.

bayesian perspective, estimator, training speed and model selection, (8 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.80)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Add feedback

Review for NeurIPS paper: A Bayesian Perspective on Training Speed and Model Selection

Neural Information Processing SystemsJan-25-2025, 21:28:59 GMT

The work considers SGD training for Bayesian linear models and illustrate a connection between training speed and generalization and why SGD tends to select simpler models. In particular, the work illustrates that a particular type of posterior sampling from gradient descent yields same model rankings as that based on the true posterior under suitable assumptions. Experiments on deep nets are also presented. The reviewers liked the work overall, but felt that some aspects of the exposition were unclear, the transition and implications for deep nets is not quite convincing especially since there is now better understanding of both optimization and generalization in deep nets, and baseline comparisons (e.g., sgld, L2 regularization, dropout, etc.) would strengthen the work.

bayesian perspective, neurips paper, training speed and model selection, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.78)

Add feedback

Generalization in the Face of Adaptivity: A Bayesian Perspective

Neural Information Processing SystemsJan-19-2025, 17:44:13 GMT

Repeated use of a data sample via adaptively chosen queries can rapidly lead to overfitting, wherein the empirical evaluation of queries on the sample significantly deviates from their mean with respect to the underlying data distribution. It turns out that simple noise addition algorithms suffice to prevent this issue, and differential privacy-based analysis of these algorithms shows that they can handle an asymptotically optimal number of queries. However, differential privacy's worst-case nature entails scaling such noise to the range of the queries even for highly-concentrated queries, or introducing more complex algorithms.In this paper, we prove that straightforward noise-addition algorithms already provide variance-dependent guarantees that also extend to unbounded queries. This improvement stems from a novel characterization that illuminates the core problem of adaptive data analysis. We show that the harm of adaptivity results from the covariance between the new query and a Bayes factor-based measure of how much information about the data sample was encoded in the responses given to past queries. We then leverage this characterization to introduce a new data-dependent stability notion that can bound this covariance.

bayesian perspective, generalization, query, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

A Bayesian Perspective on Training Speed and Model Selection

Neural Information Processing SystemsOct-10-2024, 13:46:04 GMT

We take a Bayesian perspective to illustrate a connection between training speed and the marginal likelihood in linear models. This provides two major insights: first, that a measure of a model's training speed can be used to estimate its marginal likelihood. Second, that this measure, under certain conditions, predicts the relative weighting of models in linear model combinations trained to minimize a regression loss. We verify our results in model selection tasks for linear models and for the infinite-width limit of deep neural networks. We further provide encouraging empirical evidence that the intuition developed in these settings also holds for deep neural networks trained with stochastic gradient descent.

bayesian perspective, neural network, training speed and model selection, (3 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.52)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.79)

Add feedback

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

Rychener, Yves, Kuhn, Daniel, Sutter, Tobias

arXiv.org Artificial IntelligenceJun-11-2023

We develop a principled approach to end-to-end learning in stochastic optimization. First, we show that the standard end-to-end learning algorithm admits a Bayesian interpretation and trains a posterior Bayes action map. Building on the insights of this analysis, we then propose new end-to-end learning algorithms for training decision maps that output solutions of empirical risk minimization and distributionally robust optimization problems, two dominant modeling paradigms in optimization under uncertainty. Numerical results for a synthetic newsvendor problem illustrate the key differences between alternative training schemes. We also investigate an economic dispatch problem based on real data to showcase the impact of the neural network architecture of the decision maps on their test performance.

artificial intelligence, end-to-end learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2306.04174

Country: Europe > Switzerland (0.14)

Genre: Research Report (0.50)

Industry:

Energy > Oil & Gas > Upstream (0.46)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

[R] A Bayesian Perspective on Q-Learning

#artificialintelligenceOct-23-2020, 14:00:55 GMT

Sounds good, looks like I'll be making another exposition then! So in terms of making interactive documents like this, you have a few options. I'll list them in order of easiest to hardest (assuming you code in python and don't know much web dev): You will see that you can set up various toggles to run your visualizations. The one drawback is that it's not as interactive in "real time" because every time you reconfigure the parameters you have to re-run the cell to show the results. If you're interested in this approach just add a cell block, then click on the three dots, and then click "Add a form".

bayesian perspective, reinforcement learning, social media, (4 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.76)

Add feedback

Filters

Collaborating Authors

bayesian perspective

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

322f62469c5e3c7dc3e58f5a4d1ea399-AuthorFeedback.pdf

Generalization in the Face of Adaptivity: A Bayesian Perspective

A Bayesian Perspective on Training Speed and Model Selection

322f62469c5e3c7dc3e58f5a4d1ea399-AuthorFeedback.pdf

Review for NeurIPS paper: A Bayesian Perspective on Training Speed and Model Selection

Review for NeurIPS paper: A Bayesian Perspective on Training Speed and Model Selection

Generalization in the Face of Adaptivity: A Bayesian Perspective

A Bayesian Perspective on Training Speed and Model Selection

End-to-End Learning for Stochastic Optimization: A Bayesian Perspective

[R] A Bayesian Perspective on Q-Learning